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Abstract. We show that a random set of integers with density has almost 
always more differences than sums. 



For a set A C Z set A + A = {ai + a2 ■■ Ai £ A}, and A- A ^ {ai - a2 : ai e A}. 
A finite set A is called difference dominant, if|A — A| > + and sum dominant, 
if |j4 — j4| < + Nathanson[2j constructed infinite sequences of sum dominant 
sets, and stated the opinion that the majority of all subsets of [l,n] is difference 
dominant. However, Martin and O'BryantT showed that the proportion of sum 
dominant sets is at least 2 • 10^^. They conjectured that sets of density are almost 
always difference dominant. In this note we prove this conjecture. More precisely, 
we have the following. 

Theorem 0.1. Let p„ be a sequence of real numbers with p„ e [0, 1], p„ and 

npn ^ oo. Let 0<i<n~lbe independent random variables satisfying 
P{Cin = 1) = Pn, o,nd set An — {i '■ ^in — !}• Then the probability that A„ is 
difference dominant tends to L 

Martin and O'Bryant noted that for p„ = o{n^^^^), this theorem follows from 
the fact that in this case almost every set is a Sidon set and has therefore almost 
twice as many differences as sums. 

Proof. We shall suppress the subscript n throughout our argument. 

To simplify the computations we first deal with the case p — o{nr^l'^\ The 
number of elements of A is asymptotically normal distributed with mean and vari- 
ance np^ while the expected number of solutions of the equation a; + y = u + w with 
x,y,u,v G A is 0{n^p'^). Hence, with probability tending to 1, we have 

lA - A| > |A|(|yl| - 1) - \{{x,y,u,v e A"^ : x ~ y = u ~ v} 

> (I - e){npf - e-'n^p'' > (1 - 2e)inp)^ > >\A + A|, 

and our claim follows. Hence, from now on we shall assume that p > cnT^I"^ . Define 
random variables Cii, Qii as 

^ 3a, & : a + 6 = i,^a = 6 = 1 ^ 3a, : a - 6 = i, = 6 = 1 

[0, otherwise ' [0, otherwise 

and set Sj — 'Yliiei Cj»- Then the probability of A to be difference dominant equals 
the probability of the event 52 > ^i. We first compute the expectation of Sj. 
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Noting that an even integer can be represented as the sum of two different integers 
or as the double of an integer, we obtain 



2n-2 



E5i = ^Ci 

l)/2) 



= (l-(l-p2)Ln/2J)) + 2 (I-(I-P')^'"'^ 

0<i<n-2 
2\i 

+2 (l_(l_p)(l_p2)V2-l) 

0<i<ra-2 
2|i 

Jo 

1 _ (1 _ b2)"/2 

= ^"- -los(l-V)./^ +°<" 
= 2n- ^ ^(^^/^ +0(1), 



and similarly 



= E C2. 

i——n-\-l 




n-l 




= i + 2E(i- 




= 2 / 







Since p:> n we obtain E ^2 - E Si > p 
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Next, we give an upper bound for the variance of Sj. We have 



E 52 = (E S,f + 2 Y {nCnQjk = 1) - P{Cn = l)P{Qk = 1)) 

+E^(o-^ = i)-'P(o. = i)' 

= (E S^Y + ^ Var + 2 ^ (i^lO.G/c = 1) - P{Cn = l)P{Cjk = I)] 

i i<k 

Var Sj = ^ Var C, + 2 ^ {P{CjiQk = 1) - = l)P{Cjk = 1)) • 



Our aim is to show that Var Sj = o{p~'^) for j = 1, 2, our claim then follows from 

Chebyshcv's inequality together with our estimate for E 6*2 — E 5*1 . 

Obviously, the first term on the right-hand side is already of the right magnitude, 
that is, it remains to bound the correlation of (jk and Cjj for i < k. 
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Clearly, the correlation of (.ji and Qk is non-negative, that is, it suffices to 
bound every summand from above. We use two different estimates for P{CjiCjk = 
1) — P{Cji = l)P{Cjk = 1) depending on whether P{Cjk = 1) is close to 1 or not. 
First, we have 

PiCjiCjk = 1) - = i)PiQk = 1) < P{Q^ = i)(i - PiCjk = 1)). 

On the other hand, if j = 1 and i < k < n, then 

PiCuCik = 1) - P{Cu = i)P(Cife = 1) < 

P(3/i, v, K : ^ii^ = = = I, H + v = i, IJ. + K = k) < ip^. 
Similarly, if i < n < fc, then 

PiCuCik = 1) - P{Cu = l)-P(Cife = 1) < max(0, i + n- k)p^. 
Hence, we have to show that the sum 

V- ... 3 

E mm(»/, ' ^2 ) 

i<k<-n 

is of order o{p~^). For each k we either use the first or the second estimate for all 
i, and obtain 



^ min (k'^p^, — 5^^) < min (k^p 

k<n ^ ^ 



Putting fco = 7p~^logp~^, the second term becomes o(l), while the first one is 

since p ^ 0, which is of the desired size. A similar 
computation shows that 5*2 has variance o{p^'^), and we conclude that the random 
variable 52 — Si has mean and variance o(p~^), together with p ^ our claim 
follows. □ 
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